In mathematics, the total variation identifies several slightly different concepts, related to the (local or global) structure of the codomain of a function or a measure. For a real-valued continuous function , defined on an interval [a, b] ⊂ ℝ, its total variation on the interval of definition is a measure of the one-dimensional arclength of the curve with parametric equation x ↦ (x), for x ∈ [a,b].
Contents |
The concept of total variation for functions of one real variable was first introduced by Camille Jordan in the paper (Jordan 1881).[1] He used the new concept in order to prove a convergence theorem for Fourier series of discontinuous periodic functions whose variation is bounded. The extension of the concept to functions of more than one variable however is not simple for some reasons.
real-valued (or more generally complex-valued) function , defined on an interval ⊂ℝ is the quantity
The total variation of awhere the supremum runs over the set of all partitions of the given interval.
open subset of ℝn. Given a function belonging to , the total variation of in is defined as
Let be anwhere is the set of continuously differentiable vector functions of compact support contained in , and is the essential supremum norm. Note that this definition does not require that the domain ⊆ℝn of the given function is a bounded set.
Following Saks (1937, p. 10), consider a signed measure on a measurable space : then it is possible to define two set functions and , respectively called upper variation and lower variation, as follows
clearly
The variation (also called absolute variation) of the signed measure is the set function
and its total variation is defined as the value of this measure on the whole space of definition, i.e.
Saks (1937, p. 11) uses upper and lower variations to prove the Hahn–Jordan decomposition: according to his version of this theorem, the upper and lower variation are respectively a non-negative and a non-positive measure. Using a more modern notation, define
Then and are two non-negative measures such that
The last measure is sometimes called, by abuse of notation, total variation measure.
If the measure is complex-valued i.e. is a complex measure, its upper and lower variation cannot be defined and the Hahn–Jordan decomposition theorem can only be applied to its real and imaginary parts. However, it is possible to follow Rudin (1966, pp. 137–139) and define the total variation of the complex-valued measure as follows
The variation of the complex-valued measure is thewhere the supremum is taken over all partitions of a measurable set into a finite number of disjoint measurable subsets.
The variation so defined is a positive measure (see Rudin (1966, p. 139)) and coincides with the one defined by 1.3 when is a signed measure: its total variation is defined as above. This definition works also if is a vector measure: the variation is then defined by the following formula
where the supremum is as above. Note also that this definition is slightly more general than the one given by Rudin (1966, p. 138) since it requires only to consider finite partitions of the space : this implies that it can be used also to define the total variation on finitely-additive measures.
The total variation of any probability measure is exactly one, therefore it is not interesting as a means of investigating the properties of such measures. However, when μ and ν are probability measures, the total variation distance of probability measures can be defined as
and its values are non-trivial. Informally, this is the largest possible difference between the probabilities that the two probability distributions can assign to the same event. For a categorical distribution it is possible to write the total variation distance as follows
The total variational distance for categorical probability distributions is called statistical distance: sometimes, in the definition of this distance, the factor is omitted.
The total variation of a differentiable function can be expressed as an integral involving the given function instead of as the supremum of the functionals of definitions 1.1 and 1.2.
differentiable function , defined on an interval ⊂ℝ, has the following expression if f' is Riemann integrable
The total variation of adifferentiable function defined on a bounded open set ⊆ℝn, the total variation of has the following expression
Given aThe first step in the proof is to first prove an equality which follows from the Gauss-Ostrogradsky theorem.
Under the conditions of the theorem, the following equality holds:
From the Gauss-Ostrogradsky theorem:
by subtituting , we have:
where is zero on the border of by definition:
Under the conditions of the theorem, from the lemma we have:
in the last part could be omitted, because by definition it's considerate supremum is at most one.
On the other hand we consider and which is the up to approximation of in with the same integral. We can do this hence is dense in . Now again substituting into the lemma:
This means we have a convergent sequence of that tends to as well as we know that . q.e.d.
It can be seen from the proof that the supremum is attained when
The function is said to be of bounded variation precisely if its total variation is finite.
The total variation is a norm defined on the space of measures of bounded variation. The space of measures on a σ-algebra of sets is a Banach space, called the ca space, relative to this norm. It is contained in the larger Banach space, called the ba space, consisting of finitely additive (as opposed to countably additive) measures, also with the same norm. The distance function associated to the norm gives rise to the total variation distance between two measures μ and ν.
For finite measures on ℝ, the link between the total variation of a measure μ and the total variation of a function, as described above, goes as follows. Given μ, define a function by
Then, the total variation of the signed measure μ is equal to the total variation, in the above sense, of the function φ. In general, the total variation of a signed measure can be defined using Jordan's decomposition theorem by
for any signed measure μ on a measurable space .
Total variation can be seen as a non-negative real-valued functional defined on the space of real-valued functions (for the case of functions of one variable) or on the space of integrable functions (for the case of functions of several variables). As a functional, total variation finds applications in several branches of mathematics and engineering, like optimal control, numerical analysis, and calculus of variations, where the solution to a certain problem has to minimize its value. As an example, use of the total variation functional is common in the following two kind of problems
One variable
Several variables
Measure theory
Probability theory